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Abstract 

We study the palindrome complexity of infinite sequences on finite alphabets, i.e., 
the number of palindromic factors (blocks) of given length occurring in a given se- 
quence. We survey the known results and obtain new results for some sequences, in 
particular for Rote sequences and for fixed points of primitive morphisms of constant 
length belonging to "class P" of Hof-Knill-Simon. We also give an upper bound for the 
palindrome complexity of a sequence in terms of its (block-)complexity. 



1 Introduction 

The (block-) complexity function of an infinite sequence on a finite alphabet is the number of 
factors (blocks) of given length occurring in this sequence. This notion was introduced in 1975 
by Ehrenfeucht, Lee, and Rozenberg |^. The complexity function of a sequence measures 



in some sense how "complicated" the sequence is: the reader is referred to the surveys 
0, |2^, |3^. Due inter alia to its applications to physics [0, another interesting "complexity" 



for an infinite sequence on a finite alphabet is its palindrome complexity, i.e., the number 
of palindromic factors (blocks) of given length occurring in the sequence. Combinatorial 
results on palindrome complexity for some sequences or classes of sequences were proved in 
P^ , ^, ^ |TD|, ^ . We survey known results, and give new ones, proving for example that 
the palindrome complexity of Rote sequences is constant and equal to 2. We prove that the 
palindrome complexity function of a sequence that is a fixed point of a primitive morphism of 



length d belonging to "class P" of Hof-Knill-Simon and satisfying some technical conditions 
is d- automatic. Finally, we give an upper bound for the palindrome complexity in terms of 
the (usual) complexity. 

2 Definitions and notations 

2.1 Generalities 

We will use the notation N = {0, 1,2, . . .}. We recall the following classical definitions in 
combinatorics of words. The sequences we consider in this paper are defined on a finite 
alphabet (i.e., on a finite set) A. Elements of A are called letters. The set A* is defined as 
the set of words on A, i.e., the set of (possibly empty) strings of symbols of A, equipped with 
the concatenation. In other words. A* is the free monoid for the concatenation generated by 
A. The length of a word w, denoted by 1^1, is recursively defined by: the empty word has 
length 0, and for any word w and any letter a, \wa\ = \w\ + 1. A word is called a factor of 
another word or of an infinite sequence if it occurs "without hole" in this word or sequence 
(factors are also called subwords, while the term substrings stands for the case where there 
are holes: 001 is a factor of 11001110 but only a substring of 0101010). 

If A and B are two alphabets, homomorphisms for the concatenation from A* to B* 
are called morphisms. A morphisni on A (also called substitution or inflation rule) is a 
morphism from A* into itself. A morphism is defined by its values on the letters. A constant 
length morphism (or uniform morphism) is a morphism such that the images of all letters 
have the same length. If this length is equal to d, the morphism is also called a morphism 
of length d, or a d-morphism. A sequence on A is called d-automatic if it is the pointwise 
image of a fixed point of a morphism of length d on an alphabet B (pointwise image means, 
of course, image under a morphism of length 1 from ,B to ^). A morphism 9? on ^ is called 
primitive if there exists an integer k > 1 such that the image of each letter by (p'' contains 
at least one occurrence of each letter of A. A morphism is called non-erasing if the image 
of every letter is different from the empty word. 

A (non-erasing) morphism on A can be extended to infinite sequences with values in A "by 
continuity". (The set of sequences A^ is equipped with the topology of simple convergence, 
i.e., the product topology where each copy of A is equipped with the discrete topology.) An 
infinite sequence can thus be a fixed point of a morphism. 

An infinite sequence on a finite alphabet is called recurrent if each word that occurs in 
the sequence occurs infinitely often. The sequence is called uniformly recurrent or minimal 
if each word that occurs in the sequence occurs infinitely often and the distance between 
two consecutive occurrences is bounded (some authors use the term almost-periodic for such 
sequences, while some authors call them repetitive). It is easy to prove that a sequence that 
is a fixed point of a primitive morphism is uniformly recurrent. 

2.2 Periodicity 

In this section we recall the definition of periodic sequences or words, and we give two 
theorems and a lemma that will prove useful. 
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Definition 1 



• An infinite sequence u = uqUi ... is called periodic if there exists an integer T > 1 
(called a period of the sequence) such that for each n > we have Un+r = Un- It is 
called ultimately periodic if there exists an integer C. such that the sequence v defined 
by Vn = ue+n is periodic. 

• Let w be a finite word. Any integer p > 1 such that w is a prefix of an infinite sequence 
of period p is called a period of w. The period of w is the smallest such integer. (For 
example a period of the word 01101 is 5. Its period is 3.) 

Theorems |l| and |^ below will prove useful. They are due respectively to Fine and Wilf 
|JT[ and to Lyndon and Schiitzenberger [R^. 



Theorem 1 (Fine- Wilf) Let u = {un)n>o and v = (f„)„>o be two periodic sequences with 
respective periods T and T' . If Un = Vn for more than T + T' — gcd(T, T') consecutive values 
of n, then the sequences u and v are equal. The value T + T' — gcd(T, T') is sharp. 



Remark 1 In the literature an easy corollary of this result is often called the theorem of 
Fine and Wilf, namely that if a finite word w has two periods T and T' such that \w\ > 
T + T' — gcd(T,T') , then gcd(T,T') is a period of w . The value T + T' — gcd{T,T') is sharp. 



Theorem 2 (Lyndon-Schiitzenberger) Let A be an alphabet. Let x,y,z G A*, with x 
and z non-empty. Then xy = yz if and only if there exist u,v G A* , and an integer e > 
such that X = uv, z = vu, and y = {uvYu = u{yuY . 



Lemma 1 

• // the period T of a word w satisfies T < 1^/^1/2, then all the periods of w that are 
< \w\/2 are divisible by T. 

• Let z be a word and let w be a factor of z. If z has period T, if w has period T' , and 
ifT + T' < \w\, then T' is a period of z. 

• Let z be a word and let w and w' be two factors of z. Suppose that T , the period of w, 
T' , the period of w' , and 9, the period of z, satisfy 9 + T < \w\ and 9 + T' < \w'\. 
Then T = T'. 

Proof. 

• If T is the period of w, if U is another period, then w is a prefix of a T-periodic 
sequence and of a (possibly distinct) [/-periodic sequence. If both periods are < 1^^1/2, 
then T + U < \w\. Hence the two infinite periodic sequences, coinciding on a prefix of 
length > T + [/, must be equal (from Theorem |l|). The periodic sequence thus obtained 
admits T as smallest period (namely T is the smallest period of the word w). Hence 
T divides U . (Actually it can also be noted that if T > |w|/2, then our statement is 
empty, hence holds.) 
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• The word 2; is a prefix of a T-per iodic sequence m, and tlie word w is a prefix of a 
T'-periodic sequence u' . But tlie word w is a factor of z, lience a prefix of a sequence of 
period T, say -u, obtained from the sequence u by erasing some prefix. Since T+T' < \w\ 
we have, from Theorem |l], that u' = u. Hence T' is a period of u, since u can be obtained 
by erasing a prefix of u. Hence T' is a period of z. 

• From the second item above, T and T' are periods of z. Hence clearly T' is a period 
of w and T is a period of w'. Since T is the minimal period of w and T' the minimal 
period of w', we have T <T' and T' < T. Hence T' = T. □ 



2.3 Palindromes and complexity 

Definition 2 If w = W1W2 ... is a word on the alphabet A, we denote by w the word 
obtained by reading w backwards, i.e., w = WjWj^i . . . W2W1. A palindrome is a word w such 
that w = w. (For example the words "level" and "deed" are palindromes in English.) 



Definition 3 Let u := uqUiU2 ... be a sequence on the finite alphabet A. We denote by 
faCu(n) the number of words of length n that are factors of the sequence u. We denote by 
pal„(n) the number of palindromes of length n that are factors of the sequence u. 



Remark 2 The notations used in other papers might differ. In the literature Pu{n) some- 
times stands for the block-complexity and sometimes for the palindrome complexity. We 
hope that our terminology is unambiguous. 



2.4 Sturmian sequences 

We end this section by recalling the definition of Sturmian sequences. These sequences can 
be obtained by playing billiard on squares, starting with an irrational slope. They can also 



be defined by their complexity. The reader is referred to pT|, |18|, B^, O 



Definition 4 A Sturmian sequence is a sequence u = {un)n>o whose (block-)complexity 
faCu(fc) satisfies: V/c > 1, fa.Cu{k) = k + 1. 



3 Motivation in physics 

Given a uniformly recurrent sequence u, one may consider the associated Ll-class Q (also 
called hull or induced subshift) which consists of all two-sided infinite sequences that have 
the same finite factors as u. If u is Sturmian, if u is generated by a primitive morphism, 
or if u is derived from a standard cut and project scheme, this gives widely used models 
of one- dimensional quasicrystals. That is, a one-dimensional quasicrystal is modelled by a 
suitable family of two-sided sequences which are locally indistinguishable since they have the 
same finite factors. 
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To such a structure, one may associate a family of discrete one-dimensional Sclirodinger 
operators {H^)^^q as follows: choose an injective function f : A ^ M. and define, for every 
u & Q, the operator in P{Z) by 

= 0(n + 1) + 0(n - 1) + 

The spectral properties of determine the "conductivity properties" of the given structure. 
Roughly speaking, if the spectrum is absolutely continuous, then the structure behaves like 
a conductor, while in the case of pure point spectrum, it behaves like an insulator. The 
intermediate spectral type - singular continuous spectrum - is generally expected to give rise 
to intermediate transport properties, but no one currently understands this correspondence 
very well. 

For classical (periodic or atomic) structures, singular continuous spectra do not occur. 
However, for one-dimensional quasicrystals, this spectral type appears to be rather typical, 
and there has been a lot of recent research activity focussing on results of this kind. One 



important contribution in this direction is the paper of Hof, Knill, and Simon where 
a sufficient criterion for purely singular continuity of individual such operators is derived in 
terms of a strong palindromicity property of the underlying sequence. 

From a combinatorial point of view, it is interesting to note that the criterion of Hof, 
Knill, and Simon deduces singular continuous spectrum from explicit combinatorial (and 
dynamical) properties of u. Suppose that the factors of u occur with well-defined, positive 
frequencies {Q is strictly ergodic). This is the case for a large class of sequences, including 
Sturmian sequences and sequences generated by primitive morphisms, but also for sequences 



derived from the standard or generalized cut and project schemes [0. Then the following 
holds m. 



Theorem 3 (Hof-Knill-Simon) Let u be a sequence on a finite alphabet that is not ul- 
timately periodic (hence fac„(A;) > k + 1 for every k > 1), and such that pal„ does not 
ultimately vanish, i.e., \imsupf.^^'pal^{k) > 0. We suppose that the subshift Q induced by u 
is strictly ergodic. Then, for uncountably many u E Q, the operator has purely singular 
continuous spectrum. 

This theorem applies to a large class of sequences generated by primitive morphisms, 
to all Sturmian sequences, and, more generally, to all sequences defined by circle maps. 
Furthermore, it also applies to sequences derived from a standard cut and project scheme 
with inversion-symmetric window (see |]TD|] for details). Note that by uniform recurrence 



of u, it suffices to assume that u is not periodic. We remark that there is a similar, purely 
combinatorial, sufficient condition for purely singular continuous spectrum in terms oi powers 



occurring in u, see |20] for a survey. 



4 Survey of known results 

4.1 Rudin-Shapiro and paperfolding sequences 

Paperfolding sequences are binary sequences obtained by repeatedly folding a strip of paper 
(see for example). We recall the definition. 
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Definition 5 A sequence («„)„>! with values in {0, 1} is called a paperfolding sequence 
if there exists a sequence io,ii,i2, ■ ■ ., with G {0,1} (called the sequence of unfolding 
instructions) such that 

Vm > 0, Vj > 0, M2™(2j+i) = J + i'm mod 2. 
(Note that every integer n > 1 can be uniquely written as n = 2™'(2j + 1), with m,j > 0.) 



Generalized Rudin-Shapiro sequences (in the sense of 
modulo 2" paperfolding sequences. More precisely 



are obtained by "integrating 



Definition 6 A sequence (fn)n>o is called a generalized Rudin-Shapiro sequence if vq = 
and if there exists a paperfolding sequence («„)„>! such that, for every n > 0, there 
holds Vn = ^fc=i Uk mod 2. (The "classical" Rudin-Shapiro sequence corresponds to the 
paperfolding sequence with unfolding instructions 0, 0, 1, 0, 1, 0, 1, 0, 1, . . .) 

Remark 3 There exist other generalizations of the classical Rudin-Shapiro sequence (see, 
e.g., P, 



but we restrict here to the ones given in Definition ^ above. 



The following theorem was proved in [Q. It was proved again in a more efficient and 
general way by Baake in |TD . 



Theorem 4 (Allouche) 

The palindrome complexity of any paperfolding sequence u = {un)n>i satisfies pal„(A;) = 
for any k > 14. 

The palindrome complexity of any generalized Rudin-Shapiro sequence v = (f„)„,>o satis- 
fies pal^(A;) = for any k > 15. 



4.2 The period-doubling sequence 

Definition 7 The period- doubling sequence is defined as the infinite fixed point of the mor- 
phism 01, 1 ^ 00. 



The following result was proved in |]T^ . 

Theorem 5 (Damanik) The palindrome complexity of the period- doubling sequence u sat- 
isfies 

Wk even, > 4 =^ pal„(A;) = 0, 

Vfc odd. A; > 5 paljk) = pal„(2A; - 1) = pal„(2A; + 1), 
and the first few values of pal„(A;) are given by 



k 


1 


2 


3 


4 


5 


6 


7 


pal„(/^) 


2 


1 


3 





4 





3 



In particular, the function k — > pal„(A;) takes its values in the set {0, 1, 2, 3, 4}. Furthermore, 
lim supfc^oo pal„(/c) = 4. (Actually 0, 3, and 4 are the only values that are taken infinitely 
often.) 
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4.3 Sturmian sequences 

The following nice characterization of Sturmian sequences in terms of palindrome complexity 
was given in (see also |^ for the Fibonacci sequence, which can be defined as the fixed 



point of the morphism — > 01, 1 ^ 0). 

Theorem 6 (Droubay-Pirillo) A sequence u = (M„)n>o is Sturmian if and only if its 
palindrome complexity satisfies: Wk odd, pal„(/c) = 2 and Wk >2 even, pal„(/c) = 1. 

Remark 4 A generalization of this theorem to the two-dimensional case has been obtained 



recently by Berthe and Vuillon |TB|. For a study of palindromes in episturmian sequences, 
see pSl, |33|. 



4.4 Fixed points of primitive morphisms 

Primitive morphisms are often considered because they have dynamical or combinatorial 



properties that other morphisms may not have. The following result was proved in 



Theorem 7 (Damanik-Zare) The palindrome complexity pal„(A;) of a fixed point u = 
{un)n>o of a primitive morphism is bounded (hence takes only finitely many values). 

Remark 5 As a consequence of Theorem |T^ below, we will have that the conclusion of 
Theorem |^ also holds for uniform (not necessarily primitive) morphisms. 

5 Rote sequences 



Sequences of (block-)complexity 2k were studied in [^. In that paper. Rote proved in 
particular ^B|, Theorem 3] that an inGnite 0, 1-sequence w = (m„)„>o is a complementation- 
symmetric sequence with block-complexity 2k if and only if its first difference (modulo 2 ) 
sequence [3 = {Pn)n>o, is Sturmian, where, for each n > 0, (3n '■= Wn+i — Wn mod 2. Recall 
that a complementation-symmetric sequence on a two-letter alphabet, say A = {a,b}, is a 
sequence such that for any block occurring in it, the block obtained by changing a's into 6's 
and 6's into a's is also a factor. Using Theorem ^we can compute the palindrome complexity 
of these sequences. 

Theorem 8 Let w = (w„)„>o be a complementation- symmetric sequence with complexity 
facw{k) = 2k for all k > 1. Then its palindrome complexity satisfies pal^(/c) = 2 for all 
k>l. 

Proof. 

Since fac^(l) = 2, we see that the sequence w is a binary sequence. Without loss of 
generality we may assume that it is a 0, 1-sequence. Define /3 = (/3„)„>o by: > 
Pn '■= u)n+i — Wn mod 2. Then we know that the sequence /3 is Sturmian. 



7 



Define for eacli k > 1 the maps $fc and (wfiere s = 0, 1) on words of length k on {0, 1} 

by 

$fe(aia2 . . . flfe) := &1&2 • • • &A;-i, where 6, := a^+i - mod 2, 

j 

\l/|(6i62 ■ ■ - bk) '■= SC1C2 . . . Cfc, where s = 0, 1 and Cj := s + bi mod 2. 

j=i 

It is straightforward if k is even that sends palindromes of length k to palindromes 
of length k — 1 whose central letter is 0, and that any palindrome vr of length k — 1 whose 
central letter is is the image under oi exactly two palindromes of length k, namely 
^Uin) and vl/ti(7r). 

It is also straightforward if k is odd that $fc sends palindromes of length k to palindromes 
of length k — 1, and that any palindrome tt of length A; — 1 is the image under of exactly 
two palindromes of length k, namely \l/°_i(7r) and \E'^_]^(7r). 

Now from this property of the map $fc and from the relation between the sequences w 
and (3, we see that the number of palindromes of length k occurring in the sequence w is 
equal to twice the number of palindromes of length k — 1 whose central letter is occurring 
in /3 if /c is even, and to twice the number of palindromes of length k — 1 occurring in f3 if k 
is odd. 

To conclude the proof it remains to note that any Sturmian sequence has exactly one 
palindrome of length k ii k is even , and exactly one palindrome of length k whose central 



letter is if is odd: the proof of Proposition 6 of shows there is a bijection between 
the set of palindromes of length k + 2 and the set of palindromes of length k and that this 
bijection consists of erasing the first and last (identical) letters. Since there is exactly one 
palindrome of length 1 with central letter and one palindrome of length 1 with central 
letter 1, we see immediately that, for each odd k, any Sturmian sequence has exactly one 
palindrome of length k whose central letter is and exactly one palindrome of length k 
whose central letter is 1. □ 

Remark 6 

• Another way of studying palindromes occurring in a sequence is to use the associated 
dynamical system and its (geometric) symmetry properties. In this direction, the 
reader can look at [0] (see for example Theorem 19 in that paper) and [|1^ (where, in 
particular, a two-dimensional version of Theorem |^ above is given). 

• Taking the first difference sequences modulo 2 of 0, 1-sequences for studying factors 
and complexities of certain sequences was already used in 0, ^ |] for example. 

• What is the palindrome complexity of any sequence of complexity 2k7 Rote proves 
in that the fixed point u of the morphism —>■ 001, 1 — 111 has complexity 2k. 
With a slight modification of the argument in Theorem |^ below (this morphism is not 
primitive, but as soon as a word contains a 0, we essentially know from which word it 
"comes by the morphism") the reader can prove that pal„(fc) = 2 for each k > 1. On 
the other hand, looking at another example of a sequence of complexity 2k given in 
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1^, namely the image under the morphism a ^ 0, b ^ 1, c ^ 10110, d — 101 of the 
fixed point of the morphism a — ^ ad, b bac, c bacab, d baca, it is easy to check 
that this sequence contains only one palindrome of length 6. (More precisely, we have 
for this sequence pal(A;) = 2, for k = 1,2,3,4,5, pal(A;) = 1, for k = 6,7,8,9,10, and 
pal(fc) = for > 11.) 

Let V = vqVi ... be the fixed point of the morphism —>■ 001, 1 101. The complexity 
of this sequence is given by fa.Cv{k) = 2k for k > 1 (see PB|). The reader can prove that 



the palindrome complexity of this sequence is 2 for k < 7 and for A; > 8. (Hint: prove 
there is no palindrome of length 8 nor of length 9, either by mimicking the method 
of Theorem |^ above, or by using the recursive definition of the sequence v: u^n = m„, 
U2,n+i = 0, U3n+2 = 1-) the Same vein, the reader can prove that the Chacon sequence 
defined as the infinite fixed point of — 0010, 1 — 1 does not contain palindromes 
of length 13, nor palindromes of length 14, hence it does not contain palindromes of 
length > 13. Note that the (usual) complexity of the Chacon sequence is equal to 
2fc - 1 for A; > 2 (see [H]). 

We finally note that the (conjectured) palindrome complexity of the Kolakoski sequence 
is also constant and equal to 2 Section 4.1.3]. Recall that the Kolakoski sequence 
is the sequence 

221121221 ... 

defined as the sequence on the alphabet {1,2} that begins in 2 and such that the 



sequence of its runlengths is equal to the sequence itself (see 0], see for a recent 
survey). Recall that a word w on the alphabet {1, 2} is said to be differentiable if neither 
111 nor 222 occurs, and its derivative w' is the finite sequence of lengths of blocks in 
w, discarding the first and/or last block if it has length one: (12211)' = 22, (121)' = 1. 
All factors of Kolakoski are -words, i.e., words that can be differentiated infinitely 



many times. The converse is an open conjecture. It is proved in ^5], Section 4.1.3] 
that the number of C°°-palindromes of each length > 1 is 2. Hence the palindrome 
complexity of the Kolakoski sequence is bounded by 2, and it is conjecturally constant 
and equal to 2. 



6 Fixed points of uniform primitive morphisms 

We first recall the definition of class P morphisms introduced by Hof, Knill, and Simon in 

m- 

Definition 8 (Hof-Knill-Simon) A morphism a on the (finite) alphabet A belongs to 
class P if there exists a palindrome p and for every a & A a. palindrome qa such that, for 
every a G ^, we have a{a) = pqa (or, for every a & A, a{a) = Qap)- The word p can be 
empty. If p is not empty, then some (or even all) qaS are allowed to be empty. 

We show in the following theorem that it is possible to compute the palindrome com- 
plexity of fixed points of uniform morphisms that belong to class P. 
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Theorem 9 Let a : A ^ A* be primitive and u G A!^ such that cr{u) = u. We assume the 
following: 

(i) The morphism a belongs to class P: there exists a palindrome p and, for every a & A, 
a palindrome qa such that a{a) = pqa (the case where a{a) = qaP for every a E A is 
analogous). 

(a) The morphism a is uniform. Let Ip := \p\ and Iq := \qa\, for every a E A. 

(Hi) For a ^h, qa and q^ have distinct first (and hence last) symbols. 

Under these conditions we have the following recursion for the palindrome complexity 

3?2o G N \ {0}, Vn > no, pal(n) = ^ pal(A;), 

feef 

where 

£ = {s, n = si + Ip — 2j, where < j < / — 1}, with I := Ip + Ig. 

Proof. 

Let us first explain our idea intuitively. Given any palindrome x = Xi . . . Xn occurring in 
u, we can associate with x the palindrome a{x)p = pqxiPqx2 ■ ■ -PQxnP which is a factor of u 
since cr(M) = u. Moreover, we can simultaneously delete symbols at the beginning and end of 
cr{x)p, retaining the palindromic form, without deleting q^^ (and hence qx„) completely. Thus 
we can associate with x a family of palindromes. By assumption (iii), this map is one-to-one 
when restricted to the set of palindromes of length n occurring in u. Conversely, we can do 
an inverse procedure by re-substituting a given palindrome. That is, given a palindrome we 



can decompose it (essentially uniquely by a result of Mosse [^, 0) and consider its inverse 
image under a which will turn out to be a palindrome. Again by assumption (iii), this process 
is one-to-one and we can thus establish a bijection between suitable sets of palindromes. As 
a consequence, we get equality of their cardinalities which yields recursive relations for the 
palindrome complexity of u. 

Let us be more precise. Given n G N\{0} (this will be the length of the long palindrome), 
we are looking for solutions s in N \ {0} (this will be the length of the short palindrome the 
long palindrome is coming from) of the equation 

n = si + Ip — 2j subject to the condition < j < / — 1, with I = lp + Iq. (1) 

It is clear that there are at most two solutions to this problem, and if there are two solutions 
Si, S2, then / is even and |si — S2I = 1. We will show the following: 

There exists G N \ {0} such that for every n > Uq, we have 

pal(n) = J2 P^^(^)- (2) 

k solves Equation 1 

Let Pal(m) denote the set of palindromes of length m that are factors of u. We therefore 
have pal(m) = ^Pal{m). Let n G N \ {0}. Assume first that Equation has two solutions 
k,k + 1. We will define two maps 

$ : Pal(fc) U Pal(A; + 1) ^ Pal(n), ^ : Pal(n) ^ Pal(fc) U Pal(A; + 1) 
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and show that, for n large enough, they are one-to-one. Since Pal(A;) and Pal(A; + 1) are 
clearly disjoint, this imphes Equation |^. 

The map $ is defined as explained above. Start with a palindrome x in Pal(A;) UPal(A;+ 1) 
and consider the associated pahndrome a{x)p. Then, by Equation |], |o"(x)p| > n and we 
obtain an element of Pal(?7,) by "pruning" a{x)p suitably. Call this element $(0;). It follows 
from assumption (iii) and Equation |l| that $ restricted to either Pal(A;) or Pal(/c + 1) is 
one-to-one. To see that $ is one-to-one also on their union, we invoke the recognizability 



property of u as proven by Mosse, see |42, 43|. This gives the claim immediately in the case 



Ip 7^ Iq (for n large enough to apply Mosse's result and to compare two decompositions) and 



in the case Ip = Ig one uses that u is iV-th power-free for some together with this 
argument to prove the claim. 

The definition of \1/ is slightly more complicated. Let x G Pal(n) be given. According 
to Mosse's result, for n large enough, one can draw one "bar" and hence obtain a unique 
decomposition of x using assumption (ii). (Recall that a sequence u = (un)n>o fixed point 
of a morphism ip can be written 

u = (p{u) = I (p{uo) I ^{ux) I ... 

and any factor of u will "contain" bars - possibly in a non-unique way - according to 
its position(s) in the sequence u. Note that for a constant-length morphism, knowing the 
position of one bar gives the positions of the other bars.) We will consider a slightly different 
decomposition. Namely, we will also draw bars between the p's and the g^'s, a ^ A. It is 
easy to see that for n large enough, these modified decompositions are unique as well. 
We thus have 

X = a / wi / W2 / . . . / Wr / -n, (3) 

where Wi is equal to either p or one of the g^'s, 1 < z < r, a is a non-empty suffix and 
TT a non-empty prefix of p or of some g^. Since a; is a palindrome, the "refiected modified 
decomposition" 

X = T\ I Wr I Wr-1 / . . . / Wi / a (4) 

corresponds to another modified decomposition of x which must be the same by uniqueness. 
This shows that a = vr and Wi = Wr-i+i, 1 < i < r. Moreover, the center of x must also be 
a center of some Wj in this representation. Finally, we have Wi = p either for all odd i or for 
all even i. Let us consider the first case. We have 



x = (T /p/q^Jp/ q^Jp... I p I qx^l V I (IxJ V h (5) 

and we can associate some q^^^ with both o and tt (by assumption (iii)). We therefore get 
the palindrome XqX\X2 ■ ■ ■ X2X1XQ which belongs to either P{k) or P{k + 1), for otherwise 
Equation |^ would have a solution different from k,k + 1. In the second case we proceed 
similarly, in this case a/ir are suffix/prefix of p and we obtain a palindrome X1X2 ■ . ■X2X1 
which, by the same reasoning, belongs to P{k) or P{k + 1). Call the obtained palindrome 
\l/(x). Using assumption (iii) and the unique decomposition property, we see that \1/ is one- 
to-one. This establishes the existence of a bijection between Pal(A;) U Pal(A; + 1) and Pal(n) 
and hence proves the assertion of the theorem in the case where Equation |l] has two solutions. 
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If there is one solution to Equation |T|, we can prove Equation ^ along the same lines, 
parts of the argument being even simpler than in the two-solution case. 

On the other hand, if Pal(n) is non-empty, we can define as above and our re- 
substitution argument then shows that Equation |I] must have a solution. Hence the absence 
of such a solution implies Pal(n) = 0, that is, pal(?T,) = 0. □ 

Remark 7 This theorem applies to several prominent examples, such as the period doubling 
morphism {A = {a,b}, p = a, qa = b, qh = a) and the square of the Thue-Morse morphism 
{A = {a, b}, p = e, qa = abba, qf, = baab). It also applies to the square of some generalizations 
of the Thue-Morse morphism given in |^ (see Lemma 2 and Theorem 8 of § in the case 
where m G {1, 2}). 



We know from Theorem ^ that the palindrome complexity of a fixed point m of a primitive 
morphism takes only finitely many values. It then makes sense to ask whether the sequence 
(pal„(/c))A:>i itself is generated by a morphism. If the morphism belongs to class P, satisfies 
some technical conditions, and has constant length, we give an answer in our next theorem. 

Theorem 10 Let u = (M„)n>o be a sequence that is a fixed point of a primitive morphism 
belonging to class P and satisfying the extra conditions (i), (ii), (Hi) of Theorem |^. In 
particular the morphism is uniform: let d he its length. Then the palindrome complexity 
(pal„(A;))fc>i is a d-automatic sequence. 

Proof. 

Let i G [0, / — 1] be a fixed integer and let n be an integer > (n > 1 if i = 0). We want 
to compute pal„(/n + i) using Theorem ^ above. We distinguish two cases: 

• If / is odd, the congruence Ip — 2j = i mod / has a unique solution, say Jq, belonging 
to [0, 1 — 1]. Then, from Theorem ^ we have for n large enough 

pal„(/n + i) = pal^ (n + - — ^.l^—k 



Let /(n) := pal„(n — 1) for n > 2, then 



l + i-lp + 2 jo 



f{ln + ^ + l) = f\^n + f—^j ■ (6) 

We claim this implies that for a G [0, / — 1], the sequence {f{ln + a))n>max(no,2) is a 
linear combination of the sequences (/(n + p))n>max(no,2), with p G [0, 3]. 

This is clear for a G [1, / — 1] from Equation |] above. Using Equation ^ with i = I — 1, 
we obtain for n large enough. 



f{ln + l) = f(n + l + 



l-l-lp + 2 jo 
I 
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Hence replacing n by n — 1, 



f{ln) = f\n + 



I 

and the claim is proved. 

If I is even, the congruence Ip — 2j = i mod I either has no solution (if Ip and i have 
opposite parities) or has two solutions, say ji and j2, belonging to [0, / — 1]. In the first 
case pal„(/n + i) = 0. In the second case we have for n large enough 

p<«„ . ^ pa,„ f„ ^ i^i^ V pa J. + ' - ^ '^'^ 



We conclude as above that f{n) := pal„(n — 1) for n > 2 has the property that for 
every a G [0, / — 1], the sequence (/(/n + a))„>max(no,2) is a linear combination of the 
sequences (/(n + p))„>max(no,2), with p G [0,3]. 

Now from a result of quoted below as Theorem |T1|, we see that the sequence 
(and hence the sequence (pal„(n))„>i) is (i-regular in the sense of 0. But this sequence 
takes only finitely many values, hence as noted in ^ it must be (i-automatic. □ 

Before giving the next statement for the sake of completeness, we first recall (see 0) 
that a sequence u = (m„)„>o with values in a Noetherian ring R is called d-regular for some 
integer ci > 1 if the i?-module generated by all subsequences {udtn+i)n>o with t > and 
i < — 1 has finite dimension. The following result is proved in ||^. 

Theorem 11 (Allouche-Shallit) Let u = (un)n>o be a sequence with values in a Noethe- 
rian ring R. Suppose there exist an integer d > 2, an integer t > 0, an integer r > and an 
integer uq > such that each sequence (Md*+in+^)n>no for i G [0, d^^^ — 1] is a linear combi- 
nation of the sequences {udjn+i)n>no with j<t,i<d^ — 1 and of the sequences {un+p)n>no 
with p < r. Then the sequence u is d-regular. 



7 Bounding the palindrome complexity in terms of the 
usual complexity 

Looking at some of the examples above we can ask several questions. 

• We saw that Sturmian sequences, complement-symmetric Rote sequences, the (non- 
constant) fixed point of — > 001, 1 111, and fixed points of primitive morphisms all 
have bounded palindrome complexity. On the other hand, for all these sequences the 
complexity satisfies fac(A;) = 0{k). Is it true that the property fac(A;) = 0{k) implies 
pal(/c) = 0(1)? We will answer this question positively in Theorem |1^ below. Note 
that the converse is not true, by far. For example, apply to a binary sequence u with 
complexity fac„(A;) = 2^^ the morphism — 011001, 1 001011. Then the complexity 
faCt, of the image f of m under this morphism satisfies 2^^/^ < fac„(A;) < 9 ■ 2^/^, whereas 
the palindrome complexity drops to from k = 8. 
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More generally, for which sequences is it true that pal(A;) = 0{ia.c{k) /k)l For which 
sequences is it true that < limsup;j^Q^(/c pal(/c)/fac(A;)) < +00? 

Of course this last assertion is not true for all sequences: take the sequence of binary 
digits of a normal real number. The complexity of this sequence is fac(fc) = 2^ and its 
palindrome complexity is pal(fc) = 2L(^+^)/^J. This is not true for the Rudin-Shapiro 
sequence (Theorem ^) nor for the sequence given in the third item of Remark ^. For 
this question see also Remark below. 



Is there a Pansiot-like theorem |^ for the palindrome complexity of fixed points of 
morphisms? Combining with the previous question, is it true that, if the palindrome 
complexity of a fixed point of a morphism is not ultimately 0, then it satisfies 

< limsup(pal(A;)/(/?(A;)) < +00, 

fc— >oo 

where ^{k) is either 1, log log /c, \ogk or fc? 

This question can be first experimentally addressed by looking at all palindromes of 
reasonable lengths that occur in fixed points of various morphisms, getting an idea of 
what the order of magnitude of the palindrome complexity seems to be, and ... proving 
it. We give two examples. 

— we know that the complexity of the (infinite) fixed point u of the morphism 
001, 1 — i> 1 satisfies Cik"^ < facu{k) < C2k'^ If we look for maximal 



palindromes in the sequence, i.e., factors of the sequence that are palindromes 
and that cannot be extended to longer palindromes occurring in the sequence, we 
find they are given by the recurrence wq = 0, Wm+i = 1-C"(wm). (Note that by 
"the palindrome w can be extended to a palindrome occurring in the sequence" we 
mean of course that there exists a letter a such that awa occurs in the sequence.) It 
can be proved that pal(/c) is the number of integers m such that m < k + 1, \wm\ = 
2'""'"^+m— 1 > k, and m— A; is odd, which implies pal(A;) = 1/2 log2(fc)+0(l). 

In the same vein, the complexity of the fixed point beginning in of —> 010, 
1 — 11 satisfies fac(/c) ~ fclogg log2(fc) (see |jl5|]). The behavior of the palindrome 



complexity of this sequence is different for odd and even length: pal(2n + 1) = 1 
for n > 4, whereas pal(2n) ~ log2 log2(2rz). 

Before we state the main theorem of this section, we need a definition and a preliminary 
result. 

Definition 9 Let w be a palindrome on the alphabet A, and let T be its period. 

• If T > |w|/2, the palindrome w is called non-periodic. 

• If T < |w|/2 and T is odd, the palindrome w is called a palindrome of odd period. 

• U T < \w\/2 and T is even, the palindrome w is called a palindrome of even period. 
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Remark 8 In this paper we thus call periodic a palindrome whose period (even or odd) 
is at most half of the length of the palindrome. This means in particular that a periodic 
palindrome is a palindrome that can a priori be written as A'^B, where 5 is a prefix of A 
with \B\ < \A\, and necessarily d > 2. It is not hard to see that the word w is a periodic 
palindrome if and only if there exist two palindromes B and C, and an integer d > 2, such 
that w = {BCYB (in the decomposition w = A'^B, remember that i? is a prefix of A, put 
A = BC, and compare the prefixes of length + |C| of w and w). 

Lemma 2 Let w be a palindrome of even period T. The word w is hence a prefix of an 
infinite sequence {xyxyxy . . .) of smallest period T , with \x\ = \y\ = T/2 < \w\/4. The prefix 
of length \w\ of the sequence {yxyxyx . . .) is denoted by w® and called the twin of w. Then 

• the word w® is a periodic palindrome and its period is T . Furthermore w® ^ w; 

• "taking the twin" is an involution. More precisely the map w w® is an involution 
on the set of palindromes of even period. 

Proof. 

It is clear that w and have the same period, and that the map w is an 

involution. Note that w ^ w®, otherwise x = y and T/2 would be a period of w. 

It remains to prove that w® is a palindrome. Let us write w = {xy^z, with |x| = \y\ = 
T/2, d > 2, and z prefix of xy with \z\ < \xy\. Write xy = zt (with t non-empty). As in 
Remark the words z and t must be palindromes. Hence tz = tz = zt = xy = yx. Hence 
the word yzy, which is a prefix of yzyx, is a prefix of yztz = yxyz, hence a prefix of yxyxy. 
This shows that the word y{xyY~^zy = {yxY^^yzy is a prefix of the sequence {yxyxyx . . .). 
Since the length of y{xyY^^zy is equal to \w\, we thus have an alternative definition of w®: 

if w is a palindrome of even period T {T < \w\/2), with w = [xy^z, where |x| = \y\ = 
T/2, d> 2, z prefix of xy, and \z\ < \xy\, then 

w® = y{xyY~^zy. 

Now, with the notations above, and remembering that z and t are palindromes, 

= yz{^Y~^y = yz{ztY~^y = yz{tzY~^y = y{ztY~^zy = y{xyY~^zy = w®. □ 



We now give a theorem that bounds the palindrome complexity in terms of the usual 
complexity, and that answers positively the first question at the beginning of this section. 

Theorem 12 Let u = UqUiU2 ... be an infinite non-ultimately periodic sequence on a finite 
alphabet. Then, for all k > 1, we have 

pal„(A;) < ^ fac„ (k + [tJ ) • 
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Proof. 

We first suppose that the sequence u is recurrent or that it is indexed by Z (m = 
. . . M_2M-iMoMiM2 • • •)• the latter case we suppose the sequence is not ultimately peri- 
odic "on the right". Let > 1 be a fixed integer. We split Pa\u{k) the set of palindromes of 
length k occurring in the sequence u into three classes according to their periods T: 

Palf (^) := {w e Faluik), T > k/2}, 

(this is the set of non-periodic palindromes of length k occurring in u), 

PalW(A;) ■={we Fa\u{k), T < k/2 and T odd}, 

(this is the set of palindromes of length k and of odd period occurring in u), 

Pal^^)(A;) := {w E Pal„(A;), T < k/2 and T even}, 

(this is the set of palindromes of length k and of even period occurring in u). 

• For each w G Paljf''(fc), i.e., for each non-periodic palindrome w of length k in the 
sequence u we choose an index i (assuming furthermore, which is possible, that i > 
[_k/4\ if the sequence is recurrent) such that an occurrence of w in m begins at index 
i. Then, we associate with w the language S{w) that consists of the ([fc/4j + 1) words 
of length {k + [fc/4j) that occur in u and begin at indexes between {£ — [fc/4j) and i. 
(Note that these indexes are nonnegative in the case where the sequence is recurrent, 
and well-defined if the sequence is indexed by Z.) These words are all distinct; namely 
if the words beginning at indexes i—i and i—j (where < i,j < [_k/A\) were equal, the 
word w would have a period < — < k/A < k/2 (use Theorem 0), which contradicts 
the non-periodicity of w. We thus have 

Ww G Pali°)(A;), 4^(3 (w)) = [k/A\ + 1. 

• For each w G Pal^^^(A;), i.e., for each palindrome w of length k occurring in the sequence 
u and having an odd period T, we choose an index i (assuming furthermore, which is 
possible, that £ > [A;/4J if the sequence is recurrent) such that an occurrence of w in 
u begins at index £, and such that the word of length \w\ beginning at index £ + T is 
different from w (this is possible since the sequence u is not ultimately periodic). We 
define S{w) as above. The words in S{w) are pairwise distinct: if the words of S{w) 
beginning at i—i and i—j, say Zi and zj, (where < i,j < [_k/A\) were equal, then there 
would exist two words A and B of length \ j — i\ such that Az = zB, where 

From Theorem ^ this implies the existence of two words 7 and 5 and of an integer e > 
such that A = •yd, B = ^7, and z = (75)^7. Hence Az = zB is a prefix of a periodic 
sequence of period \'~^5\ = \j — i\. Since w is a factor of Az = zB (this is the factor of 
length k beginning at index i in the sequence u, and Az = zB is the factor of length 
k+ [_k/4\ beginning at index £— max(i, j) in the sequence u), this implies from Lemma |1] 
that Az has period T (note that \j - i\ + T < j + i + k/2 < 2[k/A\ +k/2<k = \w\). 
Now, the factor of the sequence u of length k beginning at index £ + T is a factor of 
Az since T < \j — i\ (namely \j — i\ is a period of Az = zB, which contains w, hence 
|j — is a period of w, and by Lemma |l] it is a multiple of T, so T < \j — i\) hence is 
equal to the factor of length k beginning at i, i.e., to w, which gives a contradiction. 
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• We consider now the palindromes of even period. For each w G PaA^^\k), i.e., for each 
pahndrome w of length k and of even period T (remember that means in particular that 
T < k/2), we consider its twin (see Lemma Since the map w — > w;® is involutive 
on the set of palindromes of even period, and w® ^ w (Lemma |^), palindromes of even 
period can be grouped in pairs of twins. For each pair of twins {w, w;®) of length k 
and (even) period T, such that at least one of w, occurs in the sequence u, we 
choose an index i (and we can suppose i > \_k/4\ in the case where u is recurrent) 
such that one of the words w, w® occurs in u at index i, and such that its twin does not 
occur at index i + T/2, this is possible since the sequence u is not ultimately periodic: 
if it were true that for every index £ such that the word w (resp. w®) occurs in the 
sequence u at index i, then the word w® (resp. w) would occur in the sequence u at 
index i + T/2, then both words w and w® would occur in u, and, for every index i 
such that w occurs at i, w would also occur at index i + T, which would imply that u 
is ultimately periodic. We construct the set S{w) (or S'(w®)) as above, and the words 
in S{w) (or S'(w®)) are again pairwise distinct. 

We thus constructed at least #Pali°)(A;) + i^Pal^^\k) + #Pal(f)(A;)/2 > pal„(fc)/2 lan- 
guages, each of which contains [fc/4j +1 words of length k+[k/A\ that are distinct. We prove 
now that these sets are pairwise disjoint. If there exists a word z belonging to S{wi) fl S{w2) 
(hence \z\ = k + [_k/A\), then there exist indexes ii and £2, nonnegative integers ii and ^2 
both < [fc/4j, such that wi occurs at index £1 in the sequence u, W2 occurs at index £2 in 
u, and z occurs in u both at indexes £1 — ii and £2 — 12- Then Wi occurs at position ii in 
z and W2 at position 12 in z. Let h := 12 — ii (assuming ^2 > "^i)- If = 0, then Wi = W2- 
Otherwise we have h > 0. Then the prefix of length k — h of W2 is equal to the suffix of 
length k — h of Wi. We write wi = At and W2 = tB, where \t\ = k — h and 1^41 = \B\ = h. 
Now let z' := AtB be the factor of z of length k + h obtained by "superposing" wi and W2. 
Since Wi and W2 are both palindromes, we have: 

{AB)t = A{Bt) = A{tB) = z' = {At)B = {tA)B = t{AB). 

Hence, from Theorem ^ there exist two words C and D and an integer g > such that 
t= {CD)iC, AB = CD, and AB = DC. Hence z' = {CD)'}+^C, and \CD\ = \AB\ = 2h 
is a period of z'. In particular 2h is a period of wi. Since 2h < 2x2 < 2\_k/A\ < k/2, the 
word Wi is a periodic palindrome whose period, say T, divides 2h: this is a consequence of 
Lemma |l] which also gives that T is a period of z' and hence of W2- 

• If T divides h, then the equality z' = ABt shows that A = B (remember that \A\ = 
\B\ = h). Hence Wi = At and W2 = tB = Bt coincide on their prefixes of length 
\A\ = h, hence Wi = W2- 

• If T does not divide h, let AB = E^' , with \E\ = T and r > 1. Since 2h = \AB\ = 
\E'''\ = rT and T does not divide h, T must be even, and r must be odd. Let E = xy, 
with |x| = \y\ = T/2. We have AB = {xyY , which implies, since r is odd, that 
A = (xy)^^^^'^^'^x and B = y(xy)^^^^^^'^. The word xy is a prefix of AB, hence of 
ABt = z'. Since wi is a prefix of z', and since \wi\ > 2T = 2\xy\, we see that xy is a 
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prefix of Wi. Finally Wi is the prefix of length k of the sequence {xyxyxy . . .) and the 
smallest period of this sequence is T. 

Now BW = B{CDy+^^= ?/(a;?/)^'(«+i)+(''-i)/2_ Hence yx is a prefix of BtD. Since 
W2 = Bt is a prefix of BtD, and \w2\ > 2|xy|, the word yx is a prefix of W2- Now 
T = |a;y| is a period of W2, hence we see that W2 is the prefix of length k of the 
sequence [yxyxyx . . .). Since the smallest period of this sequence must be T (as for the 
sequence [xyxyxy . . .)), W2 is the twin of Wi, which is a contradiction, since we chose 
exactly one representative in each pair of twins. 

We thus have 

P^i|^(LA:/4j+l)<fac„(A;+LA;/4j). 
Since L^/4J +1 > we have that 

pal„(A;) < ^ fac„ [^J j . 



Now, if we take a non- ultimately periodic sequence u = (M„,)n,>o on an alphabet A and 
if u is not recurrent, let be a letter that does not belong to A. Define the sequence 
u* = {uDn£i on Z by u* = m„, if n > and u* = for < 0. We clearly have: 

pal„(fc) = pal„, {k)-l <l fac„. + L^J ) " 1 = ^ (fac.(fc + L^J ) + (A^ + L^J )) - 1 

<^facJ/c+L^j]. □ 



Remark 9 



As a corollary of Theorem |T2| above, we see that a sequence such that fac(A;) = 0{k) has 
bounded palindrome complexity. This result can be compared to a result of Cassaigne 
[Q: if the complexity of a sequence satisfies fac(A;) = 0{k), then (fac(fc + 1) — fac(A;)) = 
0(1). 

In particular any automatic sequence has bounded palindrome complexity, and any 
fixed point of a primitive morphism has bounded complexity (thus recovering Theo- 
rem |^). Namely the (block)-complexity of an automatic sequence is 0{k) [0, and the 
block complexity of a fixed point of a primitive morphism is also 0{k) [^, ^ . 



In view of Theorem |T2|, a natural question is: is there a universal upper bound for the 
quantity k pal(/c)/fac(fc)? (The quantity fac(/c + L/c/4j) instead of fac(fc) seems to ap- 
pear only for technical reasons). The answer is no, as shown by the following example, 
for which k pal(/c)/fac(fc) reaches \/k/A for certain values of k, while fac(A;) = 0{k^^'^); 
We only outline the proof. Start with = 1, and define 
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where Xi = 0^ """^"^^WjO^ Then define the sequence u as the hmit of wj 

when j tends to infinity. We have 

Wo = 1, wi = 10011, 

W2 = 100110000000000000011001000000000000100110000000000110010000000010011 
... (I^sl = 4997). It can be proved that 

\w,\r^2'-''-\ fac„(22^)~ 22^+2, pal„(2^^) = 22^'"' + 1. 

Hence logfac„(A;)/logA; oscillates between 1 and 3/2, while logpal„(A;)/ log/c oscillates 
between and 1/2, but these two quantities are not "in phase" so that their difference 
oscillates between 1/2 and 3/2. Note that it might be the case that ^/k pal(/c)/fac(A;) 
is still universally bounded. 

On the other hand, the constant 1/4 in the theorem can be changed, so that the 
quantity k pal(fc)/fac(fc + [ak\) is universally bounded for any fixed a > 0, the bound 
depending of course of a. 



C. Choffrut |[T6[ asked the following nice question: is it true that 

pal„(fc) = 0(v/fac,(fc))? 

Note that this is true for sequences u with maximal complexity, or even for sequences 
with complexity Q{a'^k~°-) for some a > 1 and a > (use the easy bound pal„(fc) < 
facu([^iij). This is also true from Theorem |T2| above if facu(/c) = 0{k^^'^), or if 
faCu(A;) = 0(A;^). Hence, using Pansiot's theorem [^, this is true for fixed points of 
non-trivial morphisms. 



8 On a question of Hof, Knill, and Simon 



To end this paper, we recall a question of Hof, Knill, and Simon in [^2|, Remark 3, p. 
153]: are there (minimal) sequences containing arbitrarily long palindromes that arise from 
substitutions none of which belongs to class P? This question is still open. But we prove 
in Theorem |13| below that we can restrict ourselves to particular substitutions in class P, as 
well as to nonperiodic (minimal) sequences. We first prove the following result. 

Lemma 3 Let u be a fixed point of a primitive morphism a on the alphabet A. Suppose that 
there exists a non-empty word x that is a prefix of a (a) for all a ^ A (resp. a suffix of a (a) 
for all a G A). Write a{a) = xZa (resp. a{a) = Zax). Let be the morphism defined by 
o'#{a) = ZaX (resp. cr^{a) = xZa). Then cr# is primitive, and any fixed point v of a power of 
cr# (there always exists at least one such fixed point) has the same factors as u. 
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Proof. 

The morphism is clearly primitive. Suppose now, for example, that a{a) = xZa for all 
a ^ A. Then, for all a G ^ we have xa^{a) = a{a)x. Hence, for any word w on A, we have 
xa^{w) = a{w)x. Taking for w prefixes of increasing length of u, we get a limit when this 
length goes to infinity: xa^{u) = a{u) = u. Hence, xcr#(x)cr|^(x) . . . = u. From this equality 
we see that, calling a the first letter of x (remember that x is not empty), any factor of u 
is a factor of cr;^(«) for j > Jq (remember that u is minimal, since a is primitive). Trivially 
any factor of cr^(a) for any j is a factor of u. Now there exist integers i > such that cx^ 
admits a fixed point, say v. For any such i it is clear from what precedes that v and u have 
the same factors. The case where a; is a suffix of a{a) for all a is similar. □ 

Theorem 13 

• Let u be a sequence that is a fixed point of a primitive morphism in class P. Then the 
set of factors ofu is the same as the set of factors of a sequence that is a fixed point 
of a primitive morphism in class P where the palindrome p occurring in Definition ^ 
has length or 1. We can even choose one of the two forms a — * pqa for all a, or 
a — > qaP fo all a. 

• Let u be a periodic sequence that contains arbitrarily long palindromes, then u is a fixed 
point of a morphism in class P. 

Proof. 

Let M be a sequence that is a fixed point of the primitive morphism a on the alphabet A. 
Suppose there exists a palindrome p and, for any a E A a palindrome g^, with a (a) = pqa- 



If p is empty, the first assertion of Theorem |T3] is satisfied. 

If p has even length, let p = rr for some word r. Define the morphism by cr^{a) := rg^r 
for all a E A. Applying Lemma ^ with x = r, we know that this morphism is primitive; 
furthermore there exists an integer i > such that admits a fixed point, say v, and the 
sequences v and u have the same factors. Finally, the definition of cr# shows that the image 
of any letter is a palindrome, hence that the image of any palindrome is also a palindrome. 
This proves that the image of any letter by is a palindrome. In other words, belongs 
to class P, and the corresponding palindrome p is empty. 

If p has odd length, write p = rbr for some letter b and some word r. Define, for all 
a e ^, (T# by (J#{a) := brqaV. Applying Lemma || with x = r, we mimic the proof just 
above in the case where the length of p is even, leading to a morphism in class P, whose 
corresponding palindrome p has length 1. 

Finally, if the morphism has, for example, the form a pqa for all a, we can apply 
Lemma ^ with x = p to obtain a morphism of the form a — > qaP for all a. 

We now prove that the answer to the question of Hof, Knill and Simon is negative for 
a periodic sequence (that is a fixed point of a primitive morphism). Let u = www ... be a 
periodic sequence that contains arbitrarily long palindromes. Let s be a palindromic factor 
of u such that |s| > 2\w\. We can write s = xw^y, where A; > 1, and < \y\ < \w\. Since 
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s is a palindrome, we have s = y{w)^x. Hence w is a factor of s, hence a factor of u. Thus w 
must be a factor of ww. Let ww = AwB. Since |u;| = \w\, we see that \w\ = \A\ + \B\, then 
we must have w = AB {A is a prefix of w and i? is a suffix of w). The equahty ww = AwB 
then imphes that w = BA. Hence w = AB, which shows that A and B are pahndromes. We 
conclude by noting that the sequence u = ww ... is a fixed point of the morphism r defined 
by, for all a G r(a) := w = AB that is in class P. □ 
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